The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Rough set theory is an effective approach to imprecision, vagueness and incompleteness in classification analysis and knowledge discovery. Attribute reduction and relative attribute reduction are the core of KDD. From the point of view of information, the basic concepts of rough set were analyzed in this paper. A novel attribute reduction algorithm was constructed by adopting conditional entropy and...
Mining the popular SMS messages in a short period of time is very valuable. However, traditional OLAP-based mining method is not suitable for this very large scale dataset. In this paper, we present a mining approach based on Map-Reduce parallel framework: Firstly, original dataset is pre-processed and grouped by the senders' mobile numbers. Secondly, we do a transformation to regroup the dataset...
A new thresholding algorithm and its multi-threshold extension are presented to improve the performance of image segmentation. The probabilistic rough set is used to portray the objects and backgrounds in image. The optimal threshold is given through a minimum boundary criterion. A new rough membership function is defined in the light of requirements of image thresholding.
Mining association rules is an important field in data mining. The article discussed a graph-based association mining algorithm, which directly generate frequent candidate itemsets through constructing directed graphs to form association rules. But this algorithm occupy a great deal of time for checking the candidate itemsets, so an improved algorithm proposed. The improved algorithm utilize the method...
Construction project cost forecasting is a key procedure to the mangement project. An accurate forecast can support the investment decision and ensure the project's feasible at the minimal cost. So reasonable determining and controlling the project cost become the most important task in the budget management of the construction project. A novel regression technique, called Support Vector Machines...
This paper presents an ACO-based (ant colony optimization) mining algorithm aiming to discover longer rule-chains directly. Firstly, a potential association rule directed graph (PAGraph) is created, in which, the dynamic heuristics is used to record participant-intensity of edge. Secondly, making use of ant's positive feedback, pheromone on edge that ants passed is adjusted by heuristics so that it...
Tree similarity measurement is key to tree-like data mining. In order to maximally capture common information between trees, we consider the problem of computing all common embedded subtrees, and advocate using the number/count of all common embedded subtrees as a measure of similarity. This problem is not trivial due to the inherent complexity of trees and the ensued large search space. The problem...
The parameter estimations are unstable when the determinant of the coefficient matrix of the normal equation is closed to 0 in least-squares estimation. The deviation of estimator is too great because of rounding error of calculator and it is hard to get the precise inverse of the coefficient matrix. A matrix function which is matrix power series was introduced in proposed method based on ridge estimation...
In this paper we address the problem of mining frequent closed itemsets in a highly distributed setting. The extraction of distributed frequent (close) itemsets is an important task in data mining. The paper shows how frequent closed itemsets, mined independently in each site, can be merged in order to derive globally frequent closed itemsets. Unfortunately, as distributed setting is various, it is...
This paper presents a method of estimating vehicle states using an Unscented Kalman filter (UKF). The UKF developed estimates Vehicle motion, such as yaw rate and side slip angle, from the noisy measurement set. The vehicle state estimation using a non-linear vehicle model with Unitire tire model will be compared to the measured state which is subjected to the same tests, in order to validate the...
This paper focused on computing the feature core of rough set theory by making full use of the aggregation information in a data cube. After we established a one-to-one mapping relation between equivalence classes in a decision table and nonempty cells in a data cube, a new cube-based algorithm for computing the feature core of a consistent decision table was put forward in this paper. The correctness...
Proper orthogonal decomposition (POD) method could decompose fluctuating wind pressure into proper mode space-related and principal coordinate time-related. For predicting wind field on structure's surface, this paper worked out a Matlab program decomposing & restructuring wind pressure time interval sequence, which realized its interfacing with the Surfer software whose space interpolation capacity...
According to the different proportion of unknown and uncertain information to the entropy, the present measurements for vague set were divided into three groups. Tow new measurements were proposed according to the proportion as well. Finally A weighted measure of entropy for vague set was introduced. Several data and examples were used to test the measurements above, which demonstrates the different...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.